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Abstract. We present a prospective analysis of a combined cosmic shear and cosmic microwave background data 
set, focusing on a Canada France Hawaii Telescope Legacy Survey (CFHTLS) type lensing survey and the current 
WMAP-1 year and CBI data. We investigate the parameter degeneracies and error estimates of a seven parameters 
model, for the lensing alone as well as for the combined experiments. The analysis is performed using a Monte 
Carlo Markov Chain calculation, allowing for a more realistic estimate of errors and degeneracies than a Fisher 
matrix approach. After a detailed discussion of the relevant statistical techniques, the set of the most relevant 2 
and 3-dimensional lensing contours are given. It is shown that the combined cosmic shear and CMB is particularly 
efficient to break some parameter degeneracies. The principal components directions are computed and it is found 
that the most orthogonal contours between the two experiments are for the parameter pairs (tt m , o"s), (h, n 3 ) and 
(n 3 ,a s ), where n 3 and a 3 are the slope of the primordial mass power spectrum and the running spectral index 
respectively. It is shown that an improvement of a factor 2 is expected on the running spectral index from the 
combined data sets. Forecasts for error improvements from a wide field space telescope lensing survey are also 
given. 
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1. Introduction 

The Canada- France-Hawaii Telescope Legacy Survey 1 
(CFHTLS) is a long term wide field imaging project that 
started in early 2003 and should be completed by 2008. 
The French and Canadian astronomical communities will 
spend about 500 CFHT nights to carry out imaging sur- 
veys with the new Megaprime/Megacam instrument re- 
cently mounted at the CFHT prime focus. About 160 
nights will focus on the " CFHTLS- Wide" survey that 
will cover 170 deg 2 , spread over 3 uncorrelated patches of 
7° x 7° each, in u* , g' ', r', i' , z' bands, with typical exposure 
times of about one hour per filter. The "CFHTLS- Wide" 
survey design and observing strategy are similar to the 
ViRMOS-Descart cosmic shear survey 2 but it will have a 
sky coverage 20 times larger. It is widely seen as a typical 
second generation cosmic shear survey. 

The exploration of weak gravitational distortion pro- 
duced by the large scale structures of the universe over 
field of views as large as "CFHTLS- Wide" has an enor- 
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mous potential for cosmology. Past experiences based on 
first generations cosmic shear surveys (see for example 
reviews in IVan Waerbeke &: Mellier 20031 |b!eTregier 2003| ) 
have demonstrated they can constrain the dark matter 
properties (as , and the shape of the dark matter power 
spectrum) from a careful investigation of the ellipticity in- 
duced by weak gravitational shear on distant galaxies. For 
example, the most recent cosmic shear results from the 
ViRMOS-Descart survey i|Van Waerbeke et al. 2004(1 lead 
to the conservative limits eg = 0.85 ±0.15 (99% C.L.) and 
fi m — 0.3 ± 0.15 (99% C.L.), which means an expected 
accuracy of «l-3% can be expected with the "CFHTLS- 
Wide" for the same set of cosmological parameters. The 
CFHTLS- Wide will also explore a broader wavenumber 
range (10 5 -10 2 ) than ViRMOS-Descart and will extend 
to linear scale, that will considerably ease cosmological 
interpretation of weak lensing data. The second gener- 
ation surveys will therefore permit to investigate more 
thoroughly different cosmological models, taking into ac- 
count a broad range of cosmological parameters. For in- 
stance, the CFHTLS as a probe dark energy evolution was 
stressed out by Bcnabcd & van Waerbeke 20031 

The full scientific outcome of the cosmic shear data 
from the " CFHTLS- Wide" will only be complete with 
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a joint analysis with other data sets, like Type la 
Supernovae, galaxy redshift surveys, Lyman-alpha forest, 
or CMB observations. IContaldi et al. 20031 have used the 
Red Cluster Sequence (RCS) cosmic shear survey together 
with the CMB data. It was shown that the Vl m , er 8 de- 
generacies for lensing and CMB are nearly orthogonal, 
which makes this set of parameters particularly relevant 
for such combined analysis l|Van W acrbck e et al. 20 02). 
The search for orthogonal parameter degeneracies between 
different observations is one of the most important aspects 
of parameters measurements. 

Isha k et al. 20031 recently argued that joint CMB- 
cosmic shear surveys provide an optimal data set to ex- 
plore the amplitude of the running spectral index and 
probe inflation models. They used a Fisher-Matrix anal- 
ysis on WMAP+ACBAR+CBI and a "reference survey". 
Their simulated survey covers about 400 deg 2 with a 
depth corresponding to a galaxy number density of lensed 
sources of about 60 arc-min -2 , and they restricted their 
analysis to 3000 > Z >20. They found that several parame- 
ters can be significantly improved (like as-, £l m h 2 , J7a) and 
in particular that both the spectral index n s and the run- 
ning spectral index a s errors are reduced by a factor of 2. 
Their encouraging results show that joint CMB and weak 
lensing data may provide interesting insights on inflation 
models. Here, we investigate the 2-dimensional structure 
of the parameter degeneracies between lensing and CMB 
data sets, and look for the expectation with a "CFHTLS- 
Wide"-like survey design. 

To explore the smaller scales probed by the " CFHTLS- 
Wide" , which will provide cosmic shear information down 
to 20 arc-seconds, it is preferable to avoid prior assump- 
tions regarding the Gaussian nature of the underlying dis- 
tribution, and to discard a Fisher matrix analysis. We 
used in this work the so-called Markov Chain Monte Carlo 
(MCMC) method. The MCMC computing time linearly 
scales with the number of parameters and cases the explo- 
ration of a large sample of parameters and a broad range 
of values for each. Conta ldi et al. 20031 already used this 
approach with the RCS survey to map the Q, m , cr§ param- 
eter space, but marginalised over a small set of cosmolog- 
ical parameters. The goal of this present work is to map 
the parameter space that describes cosmological models 
in order to extract series of parameter combinations that 
would minimise intersections of CMB and cosmic shear de- 
generacy tracks. Compared to the Fisher-Matrix approach 
which produces ellipses only, MCMC provides more details 
of the parameter space and eventually a more realistic es- 
timate of error improvements of the joint analyses. 

The paper is organised as follows: Section 2 intro- 
duces the gravitational lensing and defines the cosmic 
shear fiducial data used and the parameter space inves- 
tigated. Section 3 gives the details of our MCMC calcula- 
tions, limitations and convergence criteria. Section 4 shows 
the MCMC results from the cosmic shear alone, assuming 
a lensing survey similar to the CFHTLS. In Section 5 we 
present the results of the parameter degeneracies analy- 
sis on the combined cosmic shear and cosmic microwave 



background observations. The assumptions made and the 
results obtained are discussed in section 6 and we conclude 
in section 7. 

2. Cosmic shear and Cosmological parameters 

Propagation of galaxy light beams across large 
scale mass inhomogeneities produces distorted 
and (de)magnified galaxy images (for reviews 
see IMellier 19991 IBartelmann fc Schneider 20011 
IRefregier 2003) IVan Waerbeke fc Mellier 2003jl . The 
gravitational lensing magnification and shear are de- 
scribed by the amplification matrix which involves second 
order derivatives of the projected gravitational potential 
Lp : the shear, 7, and the convergence, k, 

K= \ (v,n + ^22) ; 71 = 2 Op. 11 ~ ^22) ; 72 = ^,i2-(i) 

The cosmic shear may be derived from the ellipticity of the 
galaxies. In the weak lensing approximation, the observed 
ellipticity of a galaxy is related to its intrinsic ellipticity 
and to the shear by, 

£j = £i« + 7 > ( 2 ) 

where = (e X i, e^j) and 7 = (71,72). Several correla- 
tion functions and 2-point statistics of the shear may be 
defined. In this work we will use the shear variance in a 
top-hat window of radius 8 (Eq. (JJJ)), although all 2-points 
statistics are equivalent and provide similar results (they 
are all linear combinations of the others). Let us now re- 
late this quantity to cosmology. 

We parameterize cosmological models with a set of 
13 parameters, (u b , w c , Q v , f v , h; a$, n s , a s , r, n t ; r; w; z s ). 
Each model defines a point in the high-dimensional space 
where a value of the likelihood of the model with re- 
spect to the data may be calculated. We use the CAMB 
software IjLewis et al. 2000}) to compute the dark matter 
power spectrum and transfer function. The parameters are 
defined as follow: 

— (A s , n Sl a s ) - parameterize the primordial scalar power 
spectrum: 

P(k) = A s [—) , (3) 

which is normalized with A s = P(k = 0.05/iMpc -1 ) 
and has a scale-dependent tilt. 

— (A s ,r, n t ) - parameterize the primordial tensor power 
spectrum: 

p{k) = rA -(i;T- (4) 

with k Qt = 0.002/iMpc -1 . 

— (a>&, uj c , Cl v , f u ) - the matter budget consisting of 
baryons, cold dark matter, dark energy and neutrinos, 
with u>b = fif, h 2 , tOc — Q c h 2 . 
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u b = 0.022 uj c = O.f 14 Ou = 0.73 

/„ = n k = h = 0.71 

n s (k 0s ) = 0.93 a s (k 0s ) = -0.04 r = 

u> = -l r = 0.17 
cr 8 = 0.9 = 0.8 



Size of the survey: A=170 deg" 2 

Density of galaxies: n g = 20 arcmin~ 

Intrinsic ellipticity dispersion: cr e = 0.4 
Scales probed: 0.6'<6><2deg 

90 < I < 18000 



— (h) - the hubble parameter. The spatial curvature is 
not a free parameter : D,f. = 1 — Cl v — (w c + uji,)/h 2 . 

— (w) - dark energy equation of state parameter. 

— (r) - rcionization optical depth. 

— (as) - Once the 3-dimensional power spectrum of the 
density fluctuations is derived, it is renormalized ac- 
cording to the model value of erg, losing the memory 
of the original value of A s . 

The non-linear evolution of the power spectrum is 
then evaluated using the halofit prescription of 
ISmith et al. 2003 The power spectrum of the gravita- 
tional convergence is derived by integrating the dark mat- 
ter power spectrum along the line-of-sight from the ob- 
server up to the radial coordinate of the horizon xh- 



P m 9 2 [*» g 2 ( X ) p 



k 



(5) 



io ~a 2 (x) \/k(x) 
where k is the 2-dimensional wave vector perpendicular to 
the line-of-sight. /k(x) is the comoving angular diameter 
distance to a coordinate \i and g(x) is given by 



9(X) 



(6) 



where p(x( z )) is the source redshift distribution, which 
depends on the last item in our cosmological parameters 
set: 

— (z s ) - the redshift of the sources. 

For simplicity, we assume that all the sources are at a sin- 
gle redshift, z s , but the generalisation to a broad redshift 
distribution is straightforward. 

From the power spectrum of the gravitational conver- 
gence, the top-hat shear variance inside a circle of radius 
9 can be computed: 



(7) 



2 f°° dk 
(r^)> = ^ / T Pn(k)[Jl(k9)} 2 , 

For each cosmological model generated by the Monte 
Carlo simulation described in the next section, (i 2 (9)) 
is computed and compared to a fiducial data model, 
(7 data W) that corresponds to the fiducial cosmological 
model of Tabled This is taken to be the best fit ACDM, 
flat, no neutrinos, no gravitational waves, running spec- 
tral index model, derived from the first year WMAP data 
HSpergel et al. 2003) . The sources are placed at z s = 0.8. 
The likelihood of each model is given by 

-2C= « 7 2 ) l - <7data>,) Qalaij ((t") 3 - ^data).) , (8) 



where C. 



data ij 



is the covariance matrix 



Cdataij = ({if ~ 7data i) {lj ~ 7data j)) ■ 



(9) 



The covariance matrix of a shear dispersion distribu- 
tion is analytically derived in Schneid er et al. 20021 The 
shear dispersion at a given angular scale 9 may be ex- 
pressed as an integral of the two-point correlation of 
galaxy ellipticities, £+, estimated in bins of angular sep- 
aration A$ at the center position $j, evaluated with a 
window function S+( , &i/9). The covariance of the shear 
dispersion involves the four-point ellipticities correlations. 
Eq. J2J shows they depend on the intrinsic ellipticity corre- 
lations, and therefore on the intrinsic ellipticity dispersion 
a £ , and on the shear correlations for each model. The shear 
field is assumed to be Gaussian, so that the 4-point func- 
tion can be factorised as a sum over products of 2-point 
functions. It is assumed that the covariance matrix de- 
pends solely on the fiducial cosmological model. Assuming 
a connected single field of solid angle A and a mean density 
n g of source galaxies with an intrinsic ellipticity disper- 
sion of cr e , the ensemble average of the covariance matrix, 
Cov(6>; 9i, 92) of the estimator S of (i 2 (9)) is, 



Cov(S) = 



2-KAn 2 g Jo 



2min(e 1 ,9 2 ) ^ ^ 



Ms We) 



+ 

(10) 



1 dtfitfi 



Eq. ("QJJ is the Eq. (42) in ISchneider et al. 20021 trans- 
posed for the dispersion (1 2 ). The first term is a pure 
Poisson noise term, due to the intrinsic ellipticities of the 
galaxies. Its value decreases rapidly with the scale 9. The 
second term is determined by C' ++ (-d 1,1)2), the shear cor- 
relations dependent part of C++ (the covariance matrix 
of £+). It is computed using equations (32) and (34) of 
Schneid er et al. 2"t )02 with the the shear correlation func- 
tion computed from the convergence power spectrum at 
the fiducial model as, 



e±(*) 



1 

2^ 



dkkP K (k)J 0A (k9). 



(11) 



We do not include extra sources of errors, in particu- 
lar we assume that galaxy shape measurement and PSF 
anisotropy corrections are free of systematics. 

Table [5] summarizes the survey properties. These are 
based on real observations; the values for a e and n g (the 
effective density, once galaxy selection is done) are found 
in cosmic shear surveys l|Van Wacrbck e et al. 2002|) and 
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Fig. 1. Shear variance as a function of scale. The bottom left plot shows the fiducial model with 3 a error bars. In the 
other 8 plots, we change the labelled parameter by a step of +10% (solid line) and —10% (dashed line), except for a s 
where the step is ±100%. In each case, the other 7 labelled parameters are kept constant. Flatness is not imposed. 



the field size is the total size of the CFHT Legacy Survey. 
To choose the upper limit on the angular scale, we notice, 
from Eq. IjlOII . that the computation of the covariance of 
the shear dispersion at a scale 9 involves an integration up 
to 2 9. Furthermore, the integrations involved in the com- 
putation of C' ++ need an extra factor of \[2. We define 
9max to be such that 2 \J2 9 max corresponds to the largest 
wavelength that can fit in a field of the survey area. Given 
that the fields of the CFHTLS-Wide survey have an ap- 
proximate size of 7° x 7° , we find 9 ma x to be around 2 . 
The lower limit on the scale probes the deep non-linear 
regime. 

The solid line in the bottom left plot of Fig.^shows the 
shear variance of the fiducial model (Table as function 
of angular scale, along with 3a error bars computed from 
Eq. (|10|l at 20 angular scale points ranging from 0.6 arcmin 
to 2°. The error bars are smaller at intermediate scales, 
slightly larger at the smallest scales, where they are de- 
termined by statistical noise, and are noticeably bigger at 
the largest scales, which are cosmic variance dominated. 
They are slightly optimistic at small angular scales, due to 
the Gaussianity assumption, made in the derivation of the 
covariance, lacking the non-linear enhancement of the sig- 
nal The dashed line shows the shear variance without the 



non-linear corrections. Both lines become well separated 
below 10 arcmin. The other panels on Fig. ^ illustrate the 
cosmic shear sensitivity to different cosmological param- 
eters. From this figure, it is clear that cosmic shear is 
more sensitive to i7 m , as and the source redshift, with the 
cosmic shear signal increasing with the increase of these 
parameters. It also shows that the dependence on n s is a 
stronger function of scale than for the other parameters. 
This is in agreement with theoretical expectations derived 
from linear perturbation theory ( fBernardeau et al. 19 97): 

/ n \ -(n s +2)/2 

( 7 W /2 «^ 75 4 8 • (12) 

The measurement of the signal down to small (non-linear) 
scales introduces additional parameter sensitivity with 
some degeneracies broken, like for instance the parame- 
ter pair (Sl m , as), as shown in |jain fc Seljak 1997| This 
gain in sensibility is also noticeable in the a s case. 

3. Markov chain Monte Carlo 

3.1. General considerations 

The probability distribution function (PDF) of an Tri- 
dimensional vector parameter p given the n-dimensional 
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vector data x(p ) (the posterior PDF P(p|x)), can be 
calculated using Bayes theorem from the prior PDF P(p) 
and the conditional PDF of the data given the parameter 
vector (the likelihood L(x|p)) : 



P(p|x) = 



P(p)L(x|p) 
/P(p)L(x|p)dp- 



(13) 



Its analytical computation would involve high- 
dimensional integrations, not only to compute the nor- 
malisation (also called the evidence or the marginalised 
posterior) but also to extract information from the poste- 
rior, such as the determination of means or marginalised 
lower dimensional distributions. Thus, Eq. (|13f) is analyt- 
ically solved only for special cases: For Gaussian PDFs, 
with mean given by po, we see from Eq. (|13H that the 
posterior inverse covariance matrix is Cy = 
where F is the Fisher information matrix, 

, d 2 C \ ( d 2 C 



Fj, 



r^prtor 
' ij ' 



dpidpj 



dpidpj 



(14) 



P = PO 



In general, for non-gaussian PDFs, this Fisher matrix 
method still gives valuable information, since we can al- 
ways Taylor-expand C = — In L around the point of max- 
imum likelihood, p — po, obtaining, to quadratic order, 



A£(p) = (p-po) t F(p-p ) 



(15) 



and use the Fisher matrix as a linear approximation to 
the inverse covariance matrix. Being a covariance ma- 
trix of a real physical problem, F^ 1 must be positive- 
definite. Therefore an hypersurface of constant A£(p), as 
defined by Eq. I|15l) . is an hyperellipse which will be an 
approximation of a certain a volume of the posterior. The 
one-dimensional parameters marginalised errors, i.e., inte- 
grated over the other components of the parameters vec- 
tor, are given by Api(la) = {F^ 1 ) 1 / 2 . From Eq. (jT5|) it 
is clear that Appier ) = {Fu)^ 1 / 2 is the error on pi when 
all parameters but pi are fixed at po- Note that, since the 
Fisher matrix is exactly the quantity involved in the Rao- 
Cramer inequality, the errors computed from the Fisher 
matrix approximation are always lower limits of the true 
ones. 

In practice, in order to obtain a more precise result, the 
problem is usually solved by computing the posterior at 
optimized sample points that pave the parameter space. 
The traditional approach uses a regular grid. This is a 
computer-intensive procedure with computation time ris- 
ing exponentially with the space dimension, which limits 
the number of parameters that can be explored. Markov 
chain Monte Carlo sampling IjGilks et al. 1996jl overcomes 
this limitation. 

The use of the MCMC technique in cosmo- 
logical parameters estimation was first implemented 
in [Christcn sen et al. 20011 following the proposal of 
Chri stensen fc Meyer 2001] Current tools like CosmoMC 
<jL^wTs"7Tl3rKU f e"2002f) nolonger evaluate the likelihood at 
fixed points but at selected positions of a Markov chain. 
Each chain point, Pi+\, is derived from the previous chain 



point, in such a way that the transition probability 
from pi to p i+ i times the posterior PDF of p i: equals the 
product of the transition probability frompi + i to pi by the 
posterior PDF of Pi+i- Thus, after a relaxation time, the 
chain reaches the equilibrium and constitutes a sample of 
the posterior. A clear advantage of this is that statistical 
properties of the distribution, like the mean of a parame- 
ter or a marginalised confidence interval, can be directly 
derived from discreet sample points, without need to use 
the computed values of the likelihood. Different priors may 
also be introduced adapting the weighting scheme defined 
by the sample, without need to build a new chain. 

The computing time is determined by the number of 
points needed to converge to the equilibrium distribu- 
tion. If the chain is built in an efficient way, CPU time 
scales linearly with the dimension of the parameter space. 
Computing time may be reduced by finding an analyti- 
cal expression for the posterior. For example, the Markov 
chain data is used to fit the log-likelihood with a polyno- 
mial in lSandvik et al.* 2003 

3.2. MCMC Analysis 

The MCMC code we developed is based on the Metropolis- 
Hastings algorithm ( [Metropolis et al. 1953| and 
|Hastings 1970|), like CosmoM C (jLewis fc Bridle 2002j) . 
Cog ( Slozar & Hobson 2003|l or the AnalyzeThis 
l|Doran fc Miiller 2003(1 public software. 

3.2.1. Chain progression rules 

We start several chains at different initial positions cho- 
sen randomly inside the limited part of the 7-dimensional 
parameter space we aim to explore (Table OJ- 

Table 3. Parameters and exploration range investigated 
by the MCMC chain : The upper part of the table shows 
the 7 parameters used in the initial proposal density, along 
with their exploration range. The bottom part shows ex- 
tra imposed limits to the MCMC exploration. Other in- 
dependent cosmological parameters arc kept constant at 
their fiducial values. 



LUb 




0.01 


0.04 






0.01 


0.3 


h 




0.4 


1.0 


n s 




0.5 


1.4 






0.3 


0.9 






-0.2 


0.2 






0.6 


1.2 


n K 




-0.1 


0.1 


Age 


(Gyr) 


10 


20 




< 


3fi b 





The next point is proposed using a proposal PDF, 
q{pi+i\Pi) and the unnormalised posteriors of both points 
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are compared to decide if the new point is acceptable. The 
PDF acceptance rule we use for the next step point Pi+\ 
is defined by 

/ I \ ■ ( p{Pi+i)L{x\p i+1 )q(p l \p i+1 ) \ 

a(Pi+i \Pi) = mm i (16) 

I P{pi)L{x\Pi)q{pi+i\Pi) J 

a is compared to a random number u generated from a 
uniform distribution between and 1. If a > u, then the 
new Pi+i is accepted; otherwise it is rejected and we keep 
the point Pi+i — Pi- Since both the proposal and the ac- 
ceptance densities only depend on the current element of 
the chain and not on its previous history, the resulting 
sample will be a Markov chain. It is worth noticing that 
the ratio in this definition makes the normalisation of the 
posterior PDF unnecessary. Furthermore, the presence of 
the proposal PDF in Eq. (flfjf) . ensures that p(p)L(x\p) 
is the equilibrium function, even in the case of a non- 
symmetrical q, i.e. when the probability of proposing p 
from p' is different from that of proposing p' from p. 

The result is independent of the proposal density, q. We 
use as q, at the beginning of the chain, a 2-dimcnsional 
Gaussian distribution centered at the current chain ele- 
ment. Hence, only 2 of the 7 parameters (randomly cho- 
sen) change at each step. The covariance matrix of q is 
chosen to be of the order of the expected, squared, la er- 
ror bars. The la error value is used as the step definition 
criterion and guarantees the step amplitude has an ade- 
quate size. Would it be too small, the chain would move 
too slowly and could never leave the vicinity of the best 
fit. This situation is known as poor mixing and leads to 
underestimated confidence limits. In contrast, if the pro- 
posed steps are too large, the acceptance rate will be too 
small and once again the chain will move slowly. 

In order to have an adequate initial proposal den- 
sity we derived approximate la errors from a Fisher ma- 
trix computation. Applying Eq. (fH|l to Eq. JSJ) leads to 
dTegmark et al. 1997| ), 



F ■ — 



2. 



/3=1 



°a/3 



( d(j 2 )a d(l 2 )l3 

\ dpi dpj 



(17) 



where the derivatives of the shear variance are evaluated at 
the fiducial model. To numerically evaluate these deriva- 
tives, we compute the shear variances at points p{ ld ± Ap,, 
with the deltas ranging from 0.1% to 10% depending on 
the sensitivity of the shear statistic to each cosmologi- 
cal parameter, pt. However, computing and sampling er- 
rors, coming mainly from the near cancellation of cer- 
tain combinations of derivatives, as was pointed out by 
Einscnte in et al. 19981 generate small fluctuations in the 
Fisher matrix coefficients that are amplified by matrix in- 
version when the covariance matrix is derived from the 
Fisher matrix. For this reason, we did not use the inverse 
Fisher matrix as the covariance matrix of the initial pro- 
posal density, but used instead a diagonal covariance ma- 
trix whose ratios between coefficients are equal to the ra- 
tios between the diagonal coefficients of the inverse Fisher 
matrix. It is worth noticing that one way to obtain a not 



so nearly singular Fisher matrix is to include the deriva- 
tives of the data covariance matrix in the derivation of the 
Fisher matrix formula (Eq. Ijl7|l ). 

In order to better explore the directions of degener- 
acy and consequently speed up the convergence, a non- 
diagonal proposal covariance matrix is needed. Hence, 
after 1000 steps, the covariance matrix of the chain in 
progress is computed. From it, a new set of 7 parameters, 
aligned with the eigenvectors of the evaluated correlation 
matrix, is defined. From that step on, the new sample 
points are built from a combination of 2 eigenvector di- 
rections, randomly chosen at each step. Though it defines 
the next direction, the step size does not necessarily needs 
to match the corresponding eigenvalues. In fact, after 1000 
steps the covariance matrix is smaller than it will be at its 
converged value, so we must scale it. We update periodi- 
cally the proposal covariance matrix. However, since this 
process computes a new sample covariance matrix, it can- 
not be done too frequently; otherwise the progression of 
the chain too much depends on the previous elements and 
would no longer be a Markov chain. After a few periods 
we freeze the proposal density and set the multiplicative 
correction factor to 1. The optimal scaling value depends 
on the number of dimensions probed fGelmari ~1996). 

3.2.2. Convergence and goodness 

The first elements of a chain depend on the starting 
point. It is only after a so called burn- in period that 
the chain starts sampling the target density. Although it 
is not possible to say with certainty that a finite sam- 
ple from an MCMC algorithm is representative of the 
target distribution, several tests have been proposed to 
detect non-convergence of the chain (for a review see 
ICowles fc Carlin 1994jl . In this work, we used the Gclman 
and Rubin test to estimate the size of the burn-in. 

Let us consider a Markov chain of a given parameter 
composed of 2n iterations. Due to the random selection 
process of the starting point we expect m different chains 
to differ significantly at the beginning and to converge to- 
wards the same distribution as the number of iteration 
steps increases. The burn-in interval is set when the typi- 
cal separation of several chain points at a given iteration 
is similar to the amplitude of the chain internal fluctua- 
tions. At each iteration i, two quantities can be computed 
for each parameter: 

— The within-chain variance, W(p), is the average of the 
m variances of the parameter, 

2» 



1 1 _ 

w (p) = ™E — E^w- <p>i ij)f- ( 18 ) 

j—l i—n 



— The between- chains variance, B(p), is the variance of 
the parameter means from the m chains, 

- m 

B(p) = - £(<?>* (.?)- « P >* 0') >.) 2 -(i9) 

m — 1 * — ' 



3=1 
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Fig. 2. Monitoring a chain. The upper plot shows the suc- 
cessive values of as for two of the CFHTLS simulated 
chains with different starting points. In the middle plot, 
the Gelman and Rubin test between these 2 chains is made 
at several stages. This example shows that it is possible to 
detect an apparent convergence around the iteration step 
20 x 10 3 . From the iteration 60 x 10 3 on, R started to 
consistently approach 1 for all parameters, leading us to 
choose a burn-in of 30 x 10 3 . The horizontal line is R=1.3. 
In the bottom plot, the acceptance rate of one chain is 
computed at each iteration i, using all chain elements from 
the first one up to i. In the very beginning we notice two 
opposite behaviors : a shear drop and a steady raise. The 
transition takes place at the moment the proposal den- 
sity starts to follow the covariance matrix of the sample, 
showing the efficiency of that procedure. Afterwards, the 
acceptance rate drops slowly stabilizing at about 20%. 

In the first iterations, each chain is concentrated in a dif- 
ferent starting region, so B is in most cases much larger 
than W . When the iterations increase, W grows while B 
decreases and get closer and closer to zero at convergence. 
Since it is not possible to explore the whole space, W 
is always an underestimate of the within-chain variance. 
Hence, B + W is an overestimate of the parameter vari- 
ance. In the limit i — ► oo, both estimators approach the 
true variance from opposite sides. Their ratio, 



R 



1 

Tr 



B 



Wn 
n-l 



(20) 



is therefore a suitable estimate to monitor the conver- 
gence. 

When, after 2n iterations, R is close to one for all the 
quantities of interest, i.e., for all the parameters and de- 
rived parameters we want to analyse, we assume the chain 
has converged. The first n iterations are the burn-in pe- 
riod. They are discarded and the actual marginalised pos- 
terior density of a parameter is drawn by its frequency of 



appearance in each bin, during the iterations n + 1 to In. 
When after 2n iterations there was not enough time to 
explore the tails of the distribution the errors of the tar- 
get distribution are underestimated. Therefore, it is useful 
to let the chains run for a longer period in order to get 
a better mixing. In the process, the value of the estimate 
R may raise before getting smaller again. We show an ex- 
ample of this situation in Fig. [3 where we follow a chain 
evolution. 

Once a chain has stopped, we must remove the resid- 
ual correlation between the consecutive elements. For 
this reason it is recommended to thin the chain out, 
i.e., to keep only 1 out of k consecutive elements. The 
most widespread method in the literature to determine 
the thinning factor, k, is the Raftery and Lewis method 
(RL)( |Raftery fc Lewis 19961 ) ■ This method starts by con- 



12 3 

thinning flog) 

Fig. 3. The standard deviation of as computed from dif- 
ferent samples taken from the same chain by using dif- 
ferent thinning factors is plotted against the thinning in 
logarithmic scale. 

structing several chains from the converged MCMC chain, 
by thinning the latter with several different values. A 
weight may be assigned to each one of the thinned chains, 
according to its compatibility to an independence chain 
(a chain with no correlation between its consecutive ele- 
ments). RL computes the weight of a chain from the ra- 
tio between its evidence and the evidence of an indepen- 
dence chain. The evidences are computed in the Bayesian 
Information Criterium (BIC) form of Schwa rz 19781 which 
is a gaussian approximation that may be derived from 
the Bayes formula (Eq. jT3J|). It writes, ratio = BIC = 
G 2 — 2 log n, where n is the number of elements in the 
chain. The G 2 statistic is a x 2 that measures the fit of a 
chain to an independence chain. To obtain G 2 , one counts 
the number of transitions between bins of the chain, in 
order to get the ratio between the probability of the chain 
to have a certain value at a certain step i for a given chain 
value of a previous step i — j, and the probability indepen- 
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dently of the value at i — j. In practice, when counting the 
transitions, only 2 bins are assumed, z.e, a chain element 
becomes a or a 1 as whether its parameters values are less 
or greater than a certain cut-off, that we choose to be the 
parameters 2a values. The greatest weight is attributed to 
the longest chain verifying BIC < 0. Its thinning value is 
the obtained k factor and that chain is the best-fit to an 
independence chain that can be obtained by thinning the 
original chain. 

It is worth noting that this is an application of the 
use of the evidence to the problem of selection between 
different sets. Another evidence based weighting scheme 
is used when combining different sets of cosmological data 
in lHobson et al. 20021 

There are other methods to estimate a thinning factor. 
In particular, |Tegmark et al. 2003| obtain it by defining a 
more intuitive chain correlation length. Using RL, we ob- 
tained very large thinning factors for some of the MCMC 
chains (of order 100). However, we checked the dependence 
of the results on the thinning factor, by computing the 
parameters confidence levels using several chains, thinned 
from the original chain with different values of k and found 
no appreciable difference in the results (see Fig. for an 
example). Hence, in order not to lose so many chain ele- 
ments we did not use the Raftery and Lewis method re- 
sults, but a simpler criterium: for each chain we choose the 
thinning factor as the average multiplicity of the chain el- 
ements. 

The results in the next section are computed from 4 
chains with 10 5 elements each, with a burn-in size of 30 x 
10 3 and a thinning factor of 5. The chains were merged, 
leading to a final sample with about 50 000 elements. 



4. Cosmological parameters from CFHTLS cosmic 
shear statistics only 

We will now extract information about the cosmologi- 
cal parameters from the obtained sample of the posterior 
PDF. 

We start by computing one-dimensional confidence 
levels for the parameters. Table 01 (I) shows the standard 
deviations obtained for a set of 8 parameters; the 7 original 
ones and Cl m = (u>b + uj c )/h 2 . These values are computed 
using all the sample points (hence being marginalised val- 
ues) and are shown in absolute value in the first line of 
Table 0Jl) and in percentage of the parameters fiducial 
values on the second line. Since MCMC probes a non- 
gaussian posterior, asymmetric error intervals may also 
be computed. We found the positive and negative 68% 
confidence levels do not differ much from the standard 
deviations and do not show them here. As compared to 
the early ViRMOS-Descart results, the CFHTLS configu- 
ration does not seem to increase the precision on a$. This 
is however misleading since lVan Waerbeke et al. 2002l car- 
ried out their maximum likelihood analysis using only 4 
parameters (og , Q m , T — Q m h , z s ). Hence the actual im- 
provement is eventually much better. 



One-dimensional confidence levels do not show the 
detailed statistical structure of the cosmological param- 
eter space. In the following we describe the interest 
in using a principal components analysis in the cos- 
mological parameter space, a technique pioneered in 
lEfstathiou fc Bond 19991 

4.1. Principal components analysis 

The principal components of the sample, Xi, are derived 
from the eigenvectors of the sample correlation matrix. 
The correlation matrix is the covariance matrix of the 
sample of parameters in standardized form, which means 
each parameter value is rescaled by subtracting the mean 
and dividing by the dispersion. The principal components 
(PCS) can be expressed as a linear combination of the 7 
rescaled parameters as follows: 



7 

3 = 1 



iPj-<Pj >) 



(21) 



where < Pj > and a P j are the mean and the dispersion of 
the parameter pj. The coefficients are listed in Table 0] 
(II), where the principal components are ordered by de- 
creasing accuracy. Figure 01 shows some examples of 2- 
dimcnsional plots of the parameters defined by Eq. (|21|) . 
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Fig. 4. 1 and 2a contours for pairs of principal compo- 
nents. Since they are linear combinations of standardized 
variables, they are all centered on zero. It is obvious these 
are plots of non-correlated parameters. The low scattering 
in the X\ direction indicates that the cosmological param- 
eters combination defined by X\ is strongly constrained. 
On the other hand, the high scattering in the principal 
components associated with the highest eigenvalues, indi- 
cates these components contain most of the sample disper- 
sion, and determine the cosmological parameters errors. 

The most accurate PCS are the best determined quan- 
tities by the CFHTLS-wide cosmic shear experiment. In 
order to see to which combinations of cosmological param- 
eters they correspond, one can look at the eigenvectors 
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Table 4. Numerical results for the cosmic shear sample, including la precision on individual parameters and a principal 
components analysis. I: 68% confidence levels in absolute value (first line) and in percentage of the corresponding 
parameter fiducial value (second line). II: The 7 eigenvectors of the correlation matrix ordered by decreasing accuracy. 
The column named lcr, lists the dispersion of each X parameter defined in Eq. 1|21|) . which is equivalent to the square 
root of the corresponding eigenvalue. Each line i shows the coefficients ay of Eq. Q21[l. i.e., the projections of the 
corresponding on each of the 7 parameters pj labeled on the very top of the table. Naturally the derived parameter 
TO is not used for the computation of the eigendirections and J2j=i a lj = 1- The 7 eigenvectors computed for 
mean subtracted data normalized by the means. The parameters used are (fif,, fl m , h, n s , fl v , a s , erg). The column lcr, 
lists the dispersion of each Y parameter defined in Eq. i|22|) . The next 7 columns show the components, 6y, of the 
Yi. In the last column we show the relative contribution of the main parameter involved in each pc. IV: Each line 
shows the fractional error of each of the 7 parameters (fib, £l m , h,n s ,fl v ,a s ,as) computed using a limited number of 
principal components. The first line, #pc = 1, refers to using only the projections of Yj. In = 2 both Yj and Y@ 
are used. Using all the PCS, we recover, in the last line, 100% of the error values for all parameters. 
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-0.031 
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0.176 


46 




Y 6 
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-0.235 


0.041 


-0.044 


0.055 


-0.332 


0.102 


82 




Y 7 
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-0.038 


-0.025 


0.080 


0.007 


-0.932 


0.176 
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100 


100 


100 


100 


100 





components, i.e., a high coefficient means pj strongly 
contributes to Xi. However, since we are working with 
standardized parameters, the direct reading of the coeffi- 
cients may be misleading. In fact, if we take any subset of 
2 parameters and compute its eigenvectors, they both will 
have equal components, which obviously does not mean 
each principal component has equal contributions from 
each parameter. Hence, for the purpose of obtaining a set 
of meaningful coefficients , it is adequate to rescale the pa- 
rameters differently. We rescale them by subtracting the 
mean and dividing by the mean. This is refered to as the 
fractional data in lChu et al. 20031 The covariance matrix 
of the rescaled sample relates to the original covariance 
matrix as, = Cy/(< pi ><Pj >)• 



We compute a set of principal components, Y , from 
the fractional covariance matrix, expressed as, 



Y i = ±b ij ^- <P ^ >) . (22) 
H < Pi > 



Each Yi approximately corresponds to one Xi but they are 
not equivalent since the principal components depend on 
the scaling of the variables. We show this set in Table 0] 
(III), computed for a slightly different set of cosmological 
parameters with (wf,,w c ) replaced by (il{,,f2 m ). 
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Fig. 5. 2-dimensional scatter plots colored by a third parameter putting in evidence the multi-dimensionality of the 
degeneracies. Panels are numbered from top left to bottom right. Panels 1-3: Illustrate the best determined pc in 
the form lo c — <7g — h. The well defined continuous gradients in all 3 cases show the existence of a degeneracy, even 
though that would not be obvious from the 2-dimcnsional projections alone. Panel 4: Using O m instead of ui c , the best 
determined pc already defines a narrow degeneracy in the 2-dimensional plane (fl m ,ag). Q v values are also shown. 
Panel 5: Coloring the (Q„, f2 m ) plot with the curvature values, a linear pattern appears. Panel 6: Shows the degeneracy 
u s — n s which was not clear from the eigenvectors table alone. By coloring with h it also shows that this degeneracy 
contributes to Yz- Panels 7,8: The second best determined pc relates mainly n s and h. These plots show it also has 
some correlation with a s (panel 7) and less from <7g (panel 8). 



4.1.1. Well constrained PCS 

The components of an eigenvector explicitly show the con- 
tribution of each parameter to a principal component: 

— The first one, Y\, is dominated by contributions from 
as and Q m . This is the well known erg — Sl m degener- 
acy. £l v is also non-negligible. Its presence here comes 
from its correlation with f2 m through the prior on the 
curvature used to compute the chains (see Table EJ). 

— The second best constrained direction, Y2 , couples the 
primordial spectrum index, n s , with h and O m . This 
comes from the correlation between the tilt n s and 
the parameterisation of the slope of the mass power 
spectrum to which the shear is sensitive, T = fi m h. 
Notice the coefficients of n s and h have the same sign, 



as is also the case with ag, and tt m in Y\, which shows 
the orientation of the degeneracy. 

The relative contribution of the main parameter involved 
in each principal component is in the last column of Table 
El (III). For example, the projection of Y\ on the direc- 
tion of its main parameter (erg) is 78%. Although these 
fractions may be important, they are significantly below 
100%, showing that none of these principal components 
that describe the cosmic shear spectrum depends on a 
single cosmological parameter. 

The 2-dimensional projections of Fig. [S] use colors to 
produce 3-dimcnsional plots that better describe sensitiv- 
ity of cosmic shear to cosmological parameters. The com- 
parison of Tabled (II) and Tabled (III) shows for example 
that the erg — TO degeneracy is equivalent to a u> c — h — erg 
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degeneracy. A scatter colored plot, like the top panels of 
Fig. |5j illustrates this in a simple way. On this figure, we 
plot the sample points for the 3 possible pairs of parame- 
ters, colored by the one left out. The continuous gradient 
along the third parameter is obvious. In particular, the 
degeneracy that is hidden in the erg — tu c plane becomes ev- 
ident once the points are colored according to h. Likewise, 
the cosmic shear erg — f2 ro degeneracy pattern erg cx ^m°' 5 
is shown on the fourth panel, with a continuous gradi- 
ent along n v . The fifth panel shows how this correlation 
between J7 m and fi„ is related to the curvature. 

The color plots also help in understanding degeneracies 
derived from the analysis of Table 0| The bottom panels 
of Fig. El illustrate the cases on the Y2 term either when 
a third parameter does not contribute to the degeneracy 
(erg) (producing a mixture of colors), or when it does (cv s ). 
The analysis of Table 0] alone is indeed confusing: while 
from 01 (III) it is not evident that a s contributes more 
to the second principal component than erg does, Table 
01 (II) shows what really happens. The need for a care- 
ful interpretation of this table is of primarily importance 
when fractional covariance matrix is used for parameters 
with fiducial values close to zero. The simple extraction 
of eigenvectors components to find degeneracies only pro- 
vides qualitative insight since there is no unique set of 
principal components. For these ambiguous cases, color 
plots are very useful. The bottom right panels of Fig. [SJ 
reveal the sensitivity of Y2 to a s in a much better way 
than the tables do. 

From the MCMC and the principal components anal- 
ysis, it is possible to describe some cosmic shear denegera- 
cies with empirical laws, using the best determined com- 
ponents Y\ and Y^. Since the 2-dimensional contours of 
Y\ and Y~2 are not ellipses, we made a new eigenvector 
calculation, using the logarithm of the parameters. The 
laws are established for two parameters only, marginalis- 
ing over the others. This way we found the shapes in the 
erg — Q m and the n s — h planes to be: 

erg Q Q m 52 = 0.467 ±0.008 (23) 

and hn° s 57 = 0.67 ± 0.10. Defining T = D, m h, we find a 
better constrained, Y2 related, two parameter relation, 

Tn° s e = 0.187 ±0.037. (24) 

4.1.2. Poorly constrained PCS 

As for the other principal components: 

— Y3 is dominated by fl v , f2 m and erg . This constrains the 
curvature, mixed with an Y\ orthogonal contribution 
of erg — fl m , as the coefficients of these 2 parameters 
have now opposite signs. 

— The projection of I4 on the n s — h plane constrains an 
orthogonal direction to the Y% projection. 

— Y5 mixes almost all the parameters, while being dom- 
inated by an fl m — fl v plane orthogonal direction to 
the one defining the curvature. As we will see next, Y5 



alone is almost enough to determine the parameters 
precision. 

— The two worst constrained principal directions, Yg and 
I7, are the most aligned ones with single parameters, 
being strongly dominated by fib and a s , respectively. 

Even though the best determined PCS give strong 
constraints on certain combinations of parameters, 
constraints on individual cosmological parameters are 
strongly dependent on the worst determined PCS, excep- 
tion made for a cosmological parameter aligned with a well 
constrained principal component (from the last column of 
Table 0] (III) we see there is no such case). In fact, the 
worst constrained PCS determine the size of the hyper- 
volumc defined by the sample in parameter space and ac- 
count for most of the dispersion of the sample. This is the 
reason why f2f, and a s , which dominate Yq and I7, are 
the worst determined parameters (Table 0] (I)). To inves- 
tigate the contribution of each principal component to the 
one-dimensional errors on parameters, we compute the lcr 
marginalised parameters dispersion using only a restricted 
number, n, of principal components, as: 

8-n 

er 2 (p J ,n)=<^ 2 >^(y i 6y) 2 . (25) 

1=7 

Notice the case n = 1 uses only Yj. Notice also the 
need to multiply by the parameters means, since we 
are using fractional data. Table 0] (IV) shows the lcr 
marginalised errors for each parameter as a percentage 
of the total lcr marginalised errors (shown in Table Ell)), 
computed in this way, where #pc=i refers to n = i of 
Eq. (EJ). We see that Y 7 + Y 6 + Y 5 (#pc=3) accounts 
for over 95% of several parameters lcr values and from 
Y<? + Yg + Y5 + I4 + I3 all the correct values are obtained. 
Hence, we may conclude that the statistical information of 
the sample can be reduced to 5 dimensions plus 2 narrow 
priors defined by Y\ and Y2. This way, a 5 dimensional 
MCMC would be enough to obtain an equivalent sample 
in a faster way. 

Finally, the cosmic shear dependence on the cosmolog- 
ical parameters may be summarized in a single expression 
analogous to Eq. (|12fl . It is derived by using the surface 
of least dispersion defined on the 7-dimensions surface. 
It must be orthogonal to Y\, i.e., defined by Yi = c te . 
This constant is proportional to the cosmic shear variance 
signal from all scales (since they were all integrated to 
compute the likelihood, it does not explicitly depend on 
angular scale). We found that 

7 oc erg f4 57 n° b 007 n v - 18 h-°- 02 n~ om a; a002 . (26) 

The MCMC and principal component analysis of 
CFHTLS cosmic shear data alone can be generalised with 
joint data sets. As it has been shown in this section, the 
method provides useful information on degeneracies and 
principal components and allows a description of orthog- 
onal directions. Used jointly with CMB data sets, we can 
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Table 5. Onc-dimensional results from the joint sample. 
The errors are la. Above the horizontal line are the 7 
explicitly changed MCMC parameters, while results for 
some other popular parameters are shown under the line. 
The column labeled g\ shows the gain in the parameters 
precision in relation to the values obtained with the CMB 
chains. In the last column, the gain gi is computed in 
relation to available CMB results (taken from Table 8 of 
ISpergel et al. 20031 for the case WMAPext). 
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therefore predict how CFHTLS cosmic shear and CMB 
data can be used in an optimal way to shrink the exclu- 
sion diagrams attached to each cosmological parameters. 

5. Constraints from Cosmic Shear and CMB 

We produced a different set of chains, computing the joint 
likelihood of the models with respect to the same cosmic 
shear fiducial data and CMB data. 

We used the WMAP first year data 3 : the combined TT 
power spectrum (Hinshaw et al. 2003) and the TE power 
spectrum ( |Kogut et al. 20031 ) ■ Model's likelihoods with re- 
spect to WMAP data were computed using the WMAP 
likelihood code IjVerde et al. 20Q3|) . In order to have infor- 
mation from smaller scales, we included CBI data 4 : the 
mosaic odd binning (Pear son et al. 2003 ) . The number of 
independent parameters explored by the Markov chains 
was kept at 7: (lui, ,u> c ,h,n s ,a s ,r ,A S ). The normaliza- 
tion is now parameterized by A s , with a fiducial value of 
A s = 2.6 x 10~ 9 , corresponding in our fiducial model to 
o"8 = 0.9. We restrict to flat models, hence Sl v is no longer 
an independent parameter and let r, the optical depth to 
reionization, change in the restricted region of r < 0.3., 
keeping its fiducial value at t = 0.17. 

We present results from a combination of 8 converged 
chains about 70 x 10 3 elements long from which we rejected 
the first 30 x 10 3 elements. The thinning factor is 8, leaving 
us with a final merged sample of about 40 000 elements. 

3 http://lambda.gsfc.nasa.gov 

4 http:/ /www. astro. caltech.edu/~tjp/CBI/ 
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Fig. 6. Marginalised distributions for the cosmological pa- 
rameters from the joint likelihood chains (solid line) and 
the CMB chains (dashed line). 



Table CD shows one-dimensional marginalised results 
from the joint sample. In order to explicitly see what can 
be gained when joining cosmic shear data to CMB data, 
we also produced CMB only chains. One-dimensional dis- 
tributions from both CMB and joint samples are shown in 
Fig. The ratio between the parameters standard devia- 
tions obtained with the CMB sample and the joint sample, 
g = a(CMB)/a(CFHTLS + CMB), tells us to which 
parameters the combined analysis is more efficient. These 
values are shown in column g\ of Table El As a consis- 
tency check we show in column gi the factor gained by 
the joint sample when compared with the most appropri- 
ate case of the published WMAP results. The largest gain 
is on the cluster abundance scaling, erg f2°„ 5 . As we saw, 
this parameter is roughly the first principal component of 
the cosmic shear and its error is well determined by cosmic 
shear alone. So, what makes more sense here is to compute 
the gain with respect to the cosmic shear sample and not 
to the CMB sample. In this case we find the joint sample 
brings no gain (gi = 1.1). 

Keeping in mind that the combined result of 2 inde- 
pendent experiments with errors of the same order has al- 
ready a gain of y/2, we consider the combined analysis to 
be efficient for a certain parameter if g > \/2. Hence, the 
efficiency is higher for the dark matter density, the nub- 
ble parameter, erg and the spectral indexes. To illustrate 
this result, we plot in Fig. the pairs of parameters where 
the orthogonality between CMB and CFHTLS contours 
is most striking, among all possible pairs. These are con- 
tours of equal likelihood, containing 68% and 95% of the 
sample. All the 4 cases involve only the efficient parame- 
ters. Furthermore, they correspond to cosmic shear Y\ and 
Y2 related well constrained cases we found in the previous 
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Fig. 7. Marginalised 2-dimensional 68% and 95% contours from the three samples. These are the most relevant plots 
to illustrate the largest gains on the parameters precisions. 
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Fig. 8. Primordial power spectrum parameterized by n s and a s . The values written in the panel titles define the 
dotted lines, while, in each panel, the solid lines are plotted with the fiducial values (n s — 0.93 a s — —0.04). The scale 
ranges probed by the CMB and the cosmic shear are shown in the upper left panel. 
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section. Thus, we found that projections of the best con- 
strained cosmic shear principal components are orthogo- 
nal to the corresponding CMB contours, which shows a 
complementarity between cosmic shear and CMB. 

To understand the origin of some of this complemen- 
tarity, let us consider the n s /a s case. The gain on both 
parameters is around 2, even though, as we saw, the cosmic 
shear by itself is not very sensitive to the running spectral 
index. In Fig. [5] we plot the primordial power spectrum 
from Eq. JSJ as function of the linear wavenumber. The 
spectral indexes parameterize the shape of the spectrum 
and have no further role on its evolution. Thus the op- 
posite behaviour of the CMB/cosmic shear responses to 
a change on the indexes may be understood from these 
plots. The solid line in all panels is the fiducial model 
(n s = 0.93 , a s = —0.04). It bends away from a power law 
(a s = 0, the dashed line on the upper left panel) from the 
pivot point. The dotted lines are deviations from the fidu- 
cial model, they correspond to the indexes values written 
as the panels titles. In the top panels one parameter is 
changed at a time. On the left, a change on n s produces 
changes of opposite signs at both ends of the spectrum. On 
the right, changing a s raises both ends of the spectrum. 
The bottom panels show how it is possible to mimic the 
fiducial spectrum for large (small) scales by changing both 
indexes in the same (opposite) direction. 

On the first panel, the solid horizontal lines show the 
scale ranges probed by the CMB (the line on the left) 
and the cosmic shear (on the right) data used in this 
work. These intervals were found by using the calculations 
of Tcgm aTk fc Zaldarr iaga 2002, in particular their fitting 
k « V / ( 3 7 6 [h/Mpc]. Hence, the 2 bottom plots lead to 
expect an upper left - lower right direction of degeneracy 
for the cosmic shear and an orthogonal one for the CMB. 

The shape of the n s /h lensing degeneracy has a simi- 
lar origin. The slope of the power spectrum at the scales 
probed by the cosmic shear is tt m h. A raise of h, increases 
the power at small scales that must be compensated by a 
decrease in n s . 

In order to have an explicitly view of the cosmic shear 
n s /a s degeneracy scale dependence, we produced a new 
set of 4 cosmic shear MCMC chains. This time, we only 
allowed the scalar spectral indexes to change and kept 
the other 5 of the 7 original parameters (see Table EJ) at 
the fiducial values. The chains were built following the 
procedure detailed in section 3. Due to the small number 
of parameters probed, convergence was very rapid and a 
burn-in of 300 elements was enough to reach the equi- 
librium. The models shear dispersion and likelihood were 
computed for 4 cases, distinct on the angular scales used. 
These are (in arc minutes): 

- "small" scales: (0.6,1,1.3,2,3,4,5,6) 

- "medium" scales: (8,10,15,20,25,30) 

- "large" scales: (40,50,60,80,100,120) 

- "all": all the above 20 angular scale points, which 
are the ones used in the 7-dimensional MCMC cosmic 
shear chains. 



Figure |5| plots the sample points and contours obtained. It 
shows that even inside the comparatively small region of 
Fig. |H| probed by the cosmic shear, the scale dependency 
of the n s /a s degeneracy constraint is detected, with the 
best constraint coming from the smallest scales. The sig- 
nal from the largest of the cosmic shear scales is more 
dispersed as it is shown at the bottom left panel of Fig. |5| 
Notice, in this case, the orientation of the contours is cut 
by the MCMC exploration limits imposed. 



small 



medium 




large 





Fig. 9. n s a s MCMC sample points and 68% and 95% 
confidence levels. The models likelihoods were evaluated 
using 4 different sets of angular scales - small, medium, 
large and all, denned in the text. The "small" case is the 
one that better constrains the n s — a s degeneracy found 
in the 7-dimensional analysis. 

There are parameters for which no gain was found, 
for instance, the measurement of u>b is dominated by the 
CMB. For r, even though the cosmic shear is not sensitive 
to it, the introduction of cosmic shear data strengthens the 
(jg — t correlation allowing to lower the errors on og e~ 2r 
(Table |SJ. Thus, even though we do not predict a gain on 
the measurement of r from CFHTLS+CMB data, future 
cosmic shear surveys, through a more precise measure of 
<t 8 , will be helpful in its determination. In Fig. ^| its is 
shown the correlation between erg and e _2T , the factor 
by which the CMB at small scales is damped after reion- 
ization. The information provided by the cosmic shear is 
clear. 

6. Discussion 

We studied the determination of cosmological parameters 
by a CFHTLS-Wide type of experiment. For this we have 
made some assumptions that may not exactly match the 
real situation. First of all, the real survey features will not 
be identical to the used ones. In particular, we made the 
optimistic assumption that all the 170 deg 2 of the survey 
area will be usable, whereas a size of 130 deg 2 would be 



Tereno et al: Joint cosmological parameters forecast from CFHTLS-cosmic shear and CMB data 



15 



0.80 
0.70 

0.60 
0.50 




0.80 
0.70 

0.60 
0.50 



Table 6. Cosmic shear : Wide field space telescope illus- 
tration specifications. 
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Fig. 10. Marginalised 2-dimensional 68% and 95% con- 
tours. The first plot is from the CMB sample and the sec- 
ond one from the joint sample. 

more realistic 5 . Furthermore, since we assumed the red- 
shift of the sources perfectly known, there will be an ex- 
tra source of error coming from the marginalisation over 
the real sources redshift distribution. The same happens 
with the marginalisation over other cosmological param- 
eters not taken into account in this study, such as the 
equation of state of dark energy or the neutrinos den- 
sity. The calculations into deep non-linear regime using 
the halofit formula may also bring an extra source of 
errors. 

We need also to keep in mind that we made a 
systematics-free study, with data errors being dominated 
by Poisson noise and cosmic variance. B-mode contami- 
nation due to systematic residuals could significantly in- 
crease uncertainty in the noise correction process and on 
the way it should be handled in the covariance matrix. 
However, it has recently been shown that the residual B- 
mode on the ViRMOS-Descart data could be further cor- 
rected and eventually set to zero. This result follows from a 
better understanding of the PSF variation across the CCD 
chips fsee IHoekstra 20041 and IVan Waerbeke et al. 2004|) 
and it is expected that the CFHTLS will be at least as 
good as ViRMOS-Descart in terms of image quality, hence 
we set B-modes to zero in this work. 

On the other hand, our exploration of cosmological 
parameters is more realistic and accurate than previ- 
ous studies. We use a non-diagonal data covariance ma- 
trix, allowing for correlation between scales. Furthermore 
the MCMC method probes the real shape of the pos- 
terior PDF, without assuming a gaussian distribution. 
It is also important to stress that the 2-points correla- 
tion functions do not contain all the cosmic shear in- 
formation. In particular, we did not use higher order 
statistics, like the already detected convergence skew- 
ness l|Pen et al. 2003|l or the three point shear correlation 
functions IjBernardeau et al. 20 02). that may help also to 
break degeneracies IjBernardeau et al. 1997(1 . 

We explored the cosmological parameters space us- 
ing cosmic shear, describing the results in the context 
of a principal components analysis, and found a set of 
parameters degeneracies orthogonal to CMB ones. This 
led to predict a gain of the order of 2 or 3 for several 
parameters, when combining CFHTLS- Wide data with 



Size of the survey: 


A=1000 deg z 


Density of galaxies: 


n g = 50 arcmin -2 


Intrinsic ellipticity dispersion: 


cr E = 0.3 


Scales probed: 


0.6' < 6 < 5deg 




40 < I < 18000 



WMAP and CBI data. This means, for example, pre- 
cisions of cr(crg) = 0.03 and a(£l m ) = 0.02. This re- 
sult is consistent with the parameters determinations of 
IContaldi et al. 20031 that combines CMB data with the 
Red-Sequence Cluster Survey (RCS) data, where they 
found <t(<j 8 ) = 0.05 and c(f2 m ) = 0.03. From a quick 
Fisher matrix calculation, the ratio between parameters 
errors determined by 2 cosmic shear experiments is ap- 
proximately given by a ratio of the experiments survey 
areas (A) , galaxy densities (n) and intrinsic ellipticity dis- 
persions (a) (see for example |Hu fc Tegmark 1999): 



9c 



Ai n{ a\ 
A 2 n\ o\ 



(27) 



5 This is mainly due to the masking process that reduces by 
about 20% the total sky coverage of deep surveys 



The ratio between CFHTLS- Wide and RCS (with a size 
A = 53deg 2 ) is 1.8. 

As compared to the fiducial reference survey used by 
Isha k et al. 20 03. this simple estimation predicts the pre- 
cisions obtained with their reference survey should be 4 
times better than the ones from our configuration. But we 
find the same, or only slightly bigger, a values. First of all, 
Eq. I|27l) gives only an upper limit for the combined gain. 
In fact, for a parameter whose determination is dominated 
by CMB, the cosmic shear contribution to the joint result 
must be smaller than this value. The main reason for this 
discrepancy is, however, the fact that our degraded con- 
figuration, as compared to the survey in llshak et al. 2 003 
is partly compensated by the inclusion of smaller angular 
scales. In fact, we saw that it is in the non-linear regime 
that lies, not only the cosmic shear greatest sensitivity 
to the cosmological parameters, but also the cause of its 
orthogonality to CMB. 

7. Conclusions 

The CMB/cosmic shear complementarity opens good 
prospectives for the determinations of cosmological pa- 
rameters by combining CMB and cosmic shear data sets. 
In fact, even for CFHTLS, whose contours are, in general, 
noticeably larger than the WMAP+CBI ones (Fig.|J|), we 
predict non neglectable gains. Figure UTI shows what can 
be expected with future space telescope data. These re- 
sults were produced with a cosmic shear Fisher matrix 
analysis, using Eq. (|17fl and the fiducial model of Tabled 
(except for the redshift of the sources which was moved to 
z s = 1.1). For this illustration, the data covariance matrix 
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Fig. 11. 68% and 95% C.L. for the 4 most orthogonal 
cases found in section 5. Blue is WMAP+CBI data and 
red shows predictions for the wide field space telescope - 
cosmic shear parameters of Table © (99 % C.L. is also 
shown in this case). 

of Eq. UlUfl was computed for the configuration shown in 
TablcEl which is close to the SuperNova Accelerator Probe 
/ Joint Dark Energy Missio n (SNAP/JDEM) "Wide+" 
case of |Refregier et al. 2003| The CMB ellipses are plot- 
ted from the parameters covariance matrix found with our 
WMAP+CBI chains. 

In summary, we found the best constrained parame- 
ters by 2-point cosmic shear correlation functions to be 
cr 8 ^m 5 an( A Tn°- 6 (with T = VL m h). We have shown that 
2-dimensional degeneracies defined by these parameters 
plus another one defined by n s and a s are orthogonal to 
CMB degeneracies. Due to this CMB/cosmic shear com- 
plementarity, current weak lensing surveys, such as the 
CFHTLS, already have the potential to improve the pre- 
cision on several cosmological parameters. In particular, a 
better knowledge of a s will have an impact on inflationary 
scenarios. The crucial information provided by the cosmic 
shear comes from the small scales it probes. Thus, it pro- 
vides an additional possibility, along with galaxy redshift 
surveys and Lyman-a forest, to combine with CMB data. 
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